Understanding Big Data Analytic Workloads on Modern Processors

نویسندگان

  • Zhen Jia
  • Lei Wang
  • Jianfeng Zhan
  • Lixin Zhang
  • Chunjie Luo
  • Ninghui Sun
چکیده

Big data analytics applications play a significant role in data centers, and hence it has become increasingly important to understand their behaviors in order to further improve the performance of data center computer systems, in which characterizing representative workloads is a key practical problem. In this paper, after investigating three most important application domains in terms of page views and daily visitors, we chose 11 representative data analytics workloads and characterized their micro-architectural behaviors by using hardware performance counters, so as to understand the impacts and implications of data analytics workloads on the systems equipped with modern superscalar out-of-order processors. Our study reveals that big data analytics applications themselves share many inherent characteristics, which place them in a different class from traditional workloads and scale-out services. To further understand the characteristics of big data analytics workloads we performed a correlation analysis of CPI (cycles per instruction) with other microarchitecture level characteristics and an investigation of the big data software stack impacts on application behaviors. Our correlation analysis showed that even though big data analytics workloads own notable pipeline front end stalls, the main factors affecting the CPI performance are long latency data accesses rather than the front end stalls. Our software stack investigation found that the typical big data software stack significantly contributes to the front end stalls and incurs bigger working set. Finally we gave several recommendations for architects, programmers and big data system designers with the knowledge acquired from this paper.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Processing Cyclic Multimedia Workloads on Modern Architectures

Working with modern architectures for high performance applications is increasingly more difficult for programmers as the complexity of both the system architectures and software continue to increase. The level of hand tuning and native adaptations required to achieve high performance comes at the cost of limiting the portability of the software. For instance, we show that a compute intensive D...

متن کامل

Intrinsic Data Locality of Modern Scientific Workloads

Understanding the intrinsic data locality of a workload is essential to understanding and predicting cache performance. The intrinsic data locality of a particular application or workload can be measured in a microarchitectureindependent manne,: The data resulting from these measurements ideally can be used to develop an analytic model forpredicting memory performance on different cache sizes a...

متن کامل

NpBench: A Benchmark Suite for Control plane and Data plane Applications for Network Processors

Modern network interfaces demand highly intelligent traffic management in addition to the basic requirement of wire speed packet forwarding. Several vendors are releasing network processors in order to handle these demands. Network workloads can be classified into data plane and control plane workloads, however most network processors are optimized for data plane. Also, existing benchmark suite...

متن کامل

Redoop Infrastructure for Recurring Big Data Queries

This demonstration presents the Redoop infrastructure, the first fullfledged MapReduce framework with native support for recurring big data queries. Recurring queries, repeatedly being executed for long periods of time over evolving high-volume data, have become a bedrock component in most large-scale data analytic applications. Redoop is a comprehensive extension to Hadoop that pushes the supp...

متن کامل

Amoeba: A Shape changing Storage System for Big Data

Data partitioning significantly improves the query performance in distributed database systems. A large number of techniques have been proposed to efficiently partition a dataset for a given query workload. However, many modern analytic applications involve ad-hoc or exploratory analysis where users do not have a representative query workload upfront. Furthermore, workloads change over time as ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015